Top-K High Utility Episode Mining from a Complex Event Sequence

نویسندگان

  • Sonam Rathore
  • Siddharth Dawar
  • Vikram Goyal
  • Dhaval Patel
چکیده

Utility episode mining has emerged as an interesting and challenging research topic in data mining. It finds applications in anomaly detection, biomedical data analysis, predicting stock trends etc. The number of high-utility episodes that can be extracted from a sequence depends upon the value of minimum utility threshold. It is often difficult for a user to find a suitable threshold value which fits their purpose. The sequence can generate many high-utility episodes at low threshold value and very few episodes at higher threshold values. In order to relieve the user from this tedious task, we propose an algorithm for mining top-k high utility episodes from a complex event sequence. The parameter k can be set by the user according to his/her needs. We also propose effective strategies for raising the threshold value in order to prune the search space effectively. We conduct extensive experiments on real and synthetic datasets and the experimental results demonstrate the effectiveness of our proposed strategies in terms of total execution time and the number of candidate episodes generated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences

Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...

متن کامل

Discovery of Frequent Episodes in Event Logs

Lion’s share of process mining research focuses on the discovery of end-to-end process models describing the characteristic behavior of observed cases. The notion of a process instance (i.e., the case) plays an important role in process mining. Pattern mining techniques (such as frequent itemset mining, association rule learning, sequence mining, and traditional episode mining) do not consider ...

متن کامل

Mining Frequent Elliptic Episodes from Event Sequences

In this paper, we formulate an episode in episode mining as an acyclic transitive labeled digraph. Also we introduce an elliptic episode s → [S1, . . . , Sn] → t such that s and t are event types, [S1, . . . , Sn] is a sequence of serial episodes, and every pair of Si and Sj shares no same event type. Then, we show that every elliptic episode is constructible from serial episodes. Next, in this...

متن کامل

Mining Frequent Diamond Episodes from Event Sequences

In this paper, we introduce a diamond episode of the form s1 → E → s2, where s1 and s2 are events and E is a set of events. The diamond episode s1 → E → s2 means that every event of E follows an event s1 and is followed by an event s2. Then, by formulating the support of diamond episodes, in this paper, we design the algorithm FreqDmd to extract all of the frequent diamond episodes from a given...

متن کامل

Discovering general partial orders in event streams

Sequence of time-ordered events arise in a variety of applications like customer transaction databases, alarm sequences in telecommunication networks, fault logs in manufacturing plant data, web interaction logs, etc. A popular framework for temporal pattern extraction from such data is the frequent episode discovery paradigm. An episode is a set of nodes with a partial order prescribed on it, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016